Maintaining Anti-Monotone Property for Generator with Weight and Its Mining Method

نویسندگان

  • Bingzheng Wang
  • Ran Liu
چکیده

Generator is a concise representation for frequent itemset. And it has the anti-monotone property as the frequent itemset does, which is an important property in real applications. But when itemsets are attached with weights to balance importances between themselves, the anti-monotone property of generator may not hold. Additionally generator with weight may become tough to be dealt with in many circumstances. In this paper, we adapt support weight calculation to generator definition under weight support framework through specific techniques. The anti-monotone property of generator with weight can be kept to facilitate mining works. A new method for mining generators with weights is proposed. It exploits depth-first mining strategy and prunes search space with little cost. Experimental results show that the proposed method runs properly and achieves good performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generator of Hypotheses – an Approach of Data Mining Based on Monotone Systems Theory

Generator of hypotheses is a new method for data mining. It makes possible to classify the source data automatically and produces a particular enumeration of patterns. Pattern is an expression (in a certain language) describing facts in a subset of facts. The goal is to describe the source data via patterns and/or IF...THEN rules. Used evaluation criteria are deterministic (not probabilistic). ...

متن کامل

Efficient Mining of High Utility Itemsets from Large Datasets

High utility itemsets mining extends frequent pattern mining to discover itemsets in a transaction database with utility values above a given threshold. However, mining high utility itemsets presents a greater challenge than frequent itemset mining, since high utility itemsets lack the anti-monotone property of frequent itemsets. Transaction Weighted Utility (TWU) proposed recently by researche...

متن کامل

Mining utility-oriented association rules: An efficient approach based on profit and quantity

Association rule mining has been an area of active research in the field of knowledge discovery and numerous algorithms have been developed to this end. Of late, data mining researchers have improved upon the quality of association rule mining for business development by incorporating the influential factors like value (utility), quantity of items sold (weight) and more, for the mining of assoc...

متن کامل

Scaling up cosine interesting pattern discovery: A depth-first method

This paper presents an efficient algorithm called CosMinert for interesting pattern discovery. The widely used cosine similarity, found to possess the null-invariance property and the anti-cross-support-pattern property, is adopted as the interestingness measure in CosMinert . CosMinert is generally an FP-growth-like depth-first traversal algorithm that rests on an important property of the cos...

متن کامل

Method for Extracting Valuable Common Structures from Heterogeneous Rooted and Labeled Tree Data

The most commonly adopted approach to find valuable information from tree data is to extract frequently occurring subtree patterns. Because mining frequent tree patterns has a wide range of applications such as XML mining, web usage mining, bioinformatics, and network multicast routing, many algorithms have been recently proposed to find the patterns. However, existing tree mining algorithms su...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JCP

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2013